Improving Spoken Dialogue Understanding Using Phonetic Mixture Models
نویسندگان
چکیده
Reasoning about sound similarities improves the performance of a Natural Language Understanding component that interprets speech recognizer output: we observed a 5% to 7% reduction in errors when we augmented the word strings with a phonetic representation, derived from the words by means of a dictionary. The best performance comes from mixture models incorporating both word and phone features. Since the phonetic representation is derived from a dictionary, the method can be applied easily without the need for integration with a specific speech recognizer. The method has similarities with autonomous (or bottomup) psychological models of lexical access, where contextual information is not integrated at the stage of auditory perception but rather later.
منابع مشابه
Improving Spoken Dialogue Understanding Using Phonetic Mixture Model
Augmenting word tokens with a phonetic representation, derived from a dictionary, improves the performance of a Natural Language Understanding component that interprets speech recognizer output: we observed a 5% to 7% reduction in errors across a wide range of response return rates. The best performance comes from mixture models incorporating both word and phone features. Since the phonetic rep...
متن کاملA Phonetic Adaptation Module for Spoken Dialogue Systems
This paper presents a novel component for spoken dialogue systems, which adds the functionality of adapting the system’s speech output based on the user’s input. The adaptation in done on the phonetic level for adopting the user’s speech characteristics without changing the system’s own voice. An architecture for a spoken dialogue system is introduced, in which this module creates a direct link...
متن کاملImproving the speech recognition performance of beginners in spoken conversational interaction for language learning
The provision of automatic systems that can provide conversational practice for beginners would make a valuable addition to existing aids for foreign language teaching. To achieve this goal, the SCILL (Spoken Conversational Interaction for Language Learning) project is developing a spoken dialogue system that is capable of maintaining interactive dialogues with non-native students in the target...
متن کاملTowards End-to-End Spoken Dialogue Systems with Turn Embeddings
Training task-oriented dialogue systems requires significant amount of manual effort and integration of many independently built components; moreover, the pipeline is prone to errorpropagation. End-to-end training has been proposed to overcome these problems by training the whole system over the utterances of both dialogue parties. In this paper we present an end-to-end spoken dialogue system a...
متن کاملNew technique to enhance the performance of spoken dialogue systems based on dialogue states-dependent language models and grammatical rules
This paper proposes a new technique to enhance the performance of spoken dialogue systems which presents one novel contribution: the automatic correction of some ASR errors by using language models dependent on dialogue states, in conjunction with grammatical rules. These models are optimally selected by computing similarity scores between patterns obtained from uttered sentences and patterns l...
متن کامل